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Abstract. In this paper we study the state complexity of catenation combined with symmetric difference. 
First, an upper bound is computed using some combinatoric tools. Then, this bound is shown to be tight 
by giving a witness for it. Moreover, we relate this work with the study of state complexity for two other 
combinations: catenation with union and catenation with intersection. And we extract a unified approach 
which allows to obtain the state complexity of any combination involving catenation and a binary boolean 
operation. 


1 Introduction 

The study of state complexity is a very active research area for about 20 years. Its starting point 
is usually dated from 1994 with [22] although some early related works can be cited (especially 
EKED- Initially, was studied the state complexity of individual operations (see, for example, 
mm and the survey [7]), then the complexity of combined operations was investigated since 
2007. Initiated by [19], it has resulted in numerous articles f [8lll21ll3| . for example). Among them, 
we pay special attention to [4] where is studied the combination of catenation with union and 
intersection. 

In their paper, the authors observe that "the state complexity of a combined operation is not 
simply a mathematical composition of the state complexities of its component operations". Indeed, 
the state complexity of a regular n-ary operation is a function giving, from n integers standing for 
the sizes of minimal and complete DFAs, the maximal states number of a minimal and complete 
DFA accepting the resulting language of the operation applied over n languages recognized by the 
DFAs whose sizes are given as inputs. And, not surprisingly, when combining two operations, the 
language obtained to reach the state complexity of the first applied operation is not necessarily a 
relevant input to reach the state complexity of the second operation. 

The state complexity of the catenation of a m-states DFA with a n-states DFA is (m — l)2 n + 
2 n_1 022]) and the state complexity of any binary, non trivial, boolean operation of a n-states 
DFA with a /estates DFA is np (UD- In 0 it is proven that the state complexity of catenation 
combined with intersection corresponds to the mathematical combination of these complexities: 
(m — l)2 np + 2 np_1 whereas, when catenation is combined with union, the state complexity falls 
down to (m — l)(2 n+p — 2 n — 2 P + 2) + 2 n+p ~ 2 . And the authors conclude their paper with the 
following question: "Why are the state complexity results on these two very similar combined 
operations so different ?" They also mentioned that "although there is only a limited number of 
individual operations, the number of combined operations is unlimited". 

In this paper we provide some answers to these remarks and interrogations. Indeed, we will see 
that the combination L\(L2 o L3) of catenation with any regular, non trivial, binary operation is 
equivalent to the combination L\(L op L') of catenation with either U, either D or © applied to 
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languages chosen among L2, L3 and their complements. That is, it is sufficient to know the state 
complexities of catenation combined with these three binary operations to obtain all other cases. 
Moreover, while studying the only missing combination of the three (catenation with symmetric 
difference), it will emerge a unifying approach for the calculus of the three state complexities giving, 
by the way, a satisfactory answer to the question concerning the so different obtained complexities. 

The article mainly focuses on calculus of the state complexity for the combination of catena¬ 
tion with symmetric difference. In particular, it will need some non trivial combinatorial tools in 
addition to the classical automata handling. The next section gives some definitions and notations 
about automata and combinatoric. Section [ 3 ] contains some specific constructions needed later and 
describes more precisely how our approach meets, in standardizing them, the works done for cate¬ 
nation combined with union and intersection. In Section [H we give an upper bound for the state 
complexity of L\(L,2 © L3), assuming the number of some particular objects can be computed. In 
Section [51 we show how to compute these objects, curiously connecting our work to well known 
numbers in combinatorial area. In Section |6j we build a witness proving the tightness of the bound 
given in Section [ 4 ] To achieve this work, we use the family of languages defined by Brzozowski in 

2 Background 

In all this paper, £ denote a finite alphabet. The set of all finite words over £ is denoted by £*. 
The empty word is denoted by e. A language is a subset of £*. The set of subsets of a finite set A 
is denoted by 2 A and jf^A denotes the cardinality of A. The symmetric difference is denoted by the 
symbol ©. We denote by l±l the union of disjoint sets. The symbol o denotes any boolean operation 
on languages. In the following, by abuse of notation, we often write q for any singleton {g}. 

A finite automaton (FA) is a 5 -tuple A = (A, Q, /, F, •) where £ is the input alphabet, Q is a 
finite set of states, / C Q is the set of initial states, F C Q is the set of final states and • is the 
transition function from Q x £ to 2 ^. A FA is deterministic (DFA) if = 1 and for all q £ Q, 
for all a e £, ^(g • a) < 1 . Let a be a symbol of £. Let w be a word of £*. The transition function 
is extended to any word by q ■ aw = IWg.a S' • w and q ■ £ = q. 

A symmetric use of the dot notation leads to the following definition. Let w-q = {g' | g € q'-w}. 
We extend the dot notation to any set of states S by S ■ w = Uses s ' w anc ^ w ' S = Uses w ■ s. A 
word w e £* labels a successful path in a FA A if / • w fl F 7^ 0 . 

In this paper, we assume that all FA are complete which means that for all g G Q, for all a G £, 
#(q • a) > 1 . A state g is accessible in a FA if there exists a word w G £* such that g G / • w. The 
language recognized by a FA A is the set of words labeling a successful path in A. Two automata 
are said to be equivalent if they recognize the same language. 

Let D = (Z, QdAd, Fd, •) be a DFA. Two states gi,g2 of D are equivalent if for any word w 
of £*, gi • w G Fp if and only if q-2 ■ w G Fp. Such an equivalence is denoted by q± ~ g2- A DFA is 
minimal if there does not exist any equivalent DFA with less states and it is well known that for 
any DFA, there exists a unique minimal equivalent one [ 9 ]. Such a minimal DFA can be obtained 
from D by computing the accessible part of the automaton D / r^j — (E,Q D /~,[i D ],F D /~ r ) where 
for any q G Qp, [q] is the ~-class of the state q and for any a G £, [g] • a = [g • a\. In a minimal 
DFA, any two distinct states are pairwise non-equivalent. 

Any nondeterministic finite automaton B = (Z 1 , Q , /, F , •) can be converted into an equivalent 
DFA A = (£,Q',q' 0 , F 1 , ■) by a classical algorithm called the subset construction [TTJ. The set of 
states is Q' = 2^, the initial state is q' 0 = /, and the final states are the subsets of Q containing a 
state of F, that is F' = {q' G 2 Q \ q' fl F 7^ 0 } . 
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Two other classical constructions are the catenation A ■ B and the cartesian product Ax B of 
two automata A = (Z, Q A , I A , F A , ■ A ) and B = (Z, Q B , I B , F B , - B ). 

The catenation A ■ B is the automaton (Z, Q = Q A U Q B ,I, F, •) defined by: 


/ = 



if I a FF a + 
otherwise, 


^ 1 A uiiici wiac, 

p _ f F a U F b if I B n F b 0, 
] F b otherwise, 

V(g, a) E Q x F, q ■ a = 


q- B a 

q- A al)I B 


if g G Qbi 

if q -a a D F a ^ 0, 

otherwise. 


A cartesian product Ax B is an automaton (F,Q = Q A x Q B , I A x I b ,F,-) such that 
V((gi, < 72 ), a) G Q x Z, (gi,g 2 ) ■ a = qi - A a x q 2 - B a. Specifying F, we recover classical opera¬ 
tions like the intersection (F = F A x F B in this case). 

The state complexity of a regular language L denoted by sc(L) is the number of states of its 
minimal DFA. Let C n be the set of languages of state complexity n. The state complexity of a unary 
operation ® is the function sc® associating with an integer n the maximum of the state complexities 
of (®L) for L G C n . A language L G C n is a witness (for <g>) if sc (< 8 >L) = sc®(n). This can be 
generalized, and the state complexity of a k -ary operation <g) is the fc-ary function which associates 
with any tuple (ni ,..., n k ) the integer max{sc(<g)(L 1 ,..., L k ))\Li G C ni ,\/i G [l,fc]}. Then, a 
witness is a tuple (L 1; ..., L k ) G (£ ni x • • • x C nk ) such that sc(<E)(L 1? ..., L k )) = sc®(ni,..., n k ). 

An important research area consists in finding witnesses for any {ri \,... ,n k ) G and for any 

combination of elementary operations. Obviously, 


Claim 1 The state complexity of an operation defined as a composition of more elementary ones 
is upper-bounded by the composition of the corresponding elementary state complexities. 


For example, let us consider the ternary operation ® defined for any three languages L 1 , L 2l L 3 
by <g)(Li, L 2 , 1/ 3 ) = L\ ■ (L 2 o L 3 ) and let h be its state complexity. Let /, g be the respective state 
complexities of ■ and o. For any three integers ni,n 2 ,n 3 , it holds h(ni,n 2 ,n 3 ) < f(ni,g(n 2 ,n 3 )). 
Moreover, following [4], if o = n then h(rii,n 2 ,n 3 ) = f(ni,g(n 2 ,n 3 )) whereas, h(ni,n 2 ,n 3 ) < 
f(ni,g(n 2 ,n 3 )) when o = U. 

In |S], Brzozowski defines a family of languages that turns to be universal witnesses for several 
operations. The automata denoting these languages are called Brzozowski automata. We need some 
background to define these automata. We follow the terminology of [6]. Let Q = {0,..., n — 1} be 
a set. A transformation of the set Q is a mapping of Q into itself. If t is a transformation and i an 
element of Q , we denote by it the image of i under t. A transformation of Q can be represented 
by t = [i 0 , ii, ■ ■ ■ i n - 1 ] which means that i k = kt for each 0 < k < n — 1 and i k G Q. A permutation 
is a bijective transformation on Q. The identity permutation of Q is denoted by 1 q. A cycle of 
length £ < n is a permutation c, denoted by (i 0 ,i 1 ,.. .ie-i), on a subset / = {*o,... ,ie-i} of Q 
where i k c = i k + 1 for 0 < k < £ — 1 and ii-\C = i®. A k-rotation is obtained by composing k 
times the same cycle. In other word, we construct a fc-rotation r k from the cycle (io,... fit-i) by 
setting ijr k = ij+k mod 1 for 0 < j < £ — 1. A grouping of two elements ij,i k G / is a ((k — j ) 
mod f?)-rotation letting i k unchanged and obtained by iterating (k — j ) mod £ times the cycle 
(*o, • • •, 4-i, 4+i, • • •, 4-i). Such a grouping sends ij to 4+i and 4 to 4- A transposition t = ( i,j ) 
is a permutation on Q where it = j and jt = i and for every elements k E Q \ {i, j}, kt = k. A 

contraction t — ( * J is a transformation where it = j and for every elements k G Q \ {?}, kt = k. 
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In any complete DFA (S,Q,i, F, ■), any word of E* induces a transformation over 0. Let 
a,b,c,d be distinct symbols of E. As an example of Brzozowski automata (see Figure [[]), let 
W n (a, b, c, d) = (A, Q n , 0, {n — 1}, •) where Q n = {0,1,..., n — 1}, the symbol a acts as the cycle 

( 0 , 1 ,..., n — 1 ), b acts as the transposition (n — 2,n — 1 ), c acts as the contraction and d 

acts as l Qn . 


b, c, d 


6, d 


b, c, d 


b, c, d 


c, d 


c, d 



Consider a discrete set of objects S. The most general way to define formal series is to consider 
families of complex numbers indexed by elements of S, A — (a 0 ) 0 es, written formally as infinite 
sums A ~ e C[[«5]], in order to endow the space C[[«S]] with an algebraic structure 

related to a question that we want to study. 

The notion of combinatorial classes (see e.g. 0 ) is suited for solving enumeration problems. 
Let us recall few basic definitions and properties. A combinatorial class is defined by a map a 
from a discrete set S (the objects we want to enumerate) to another discrete set T (the objects 
describing a certain statistic on S ) such that each set S\ = {o G S \ a(o) = A} is finite for any 
A e 7L The series associated to a is < 7 a = #«^a[A]. 

The most classical examples of such a construction are the (univariate) generating series. In 
this case, we have T = N, a represents the size of the objects of S and the generating series is 
cr a = X^neN [n]. In order to solve enumeration problems, we have to endow the space of series 
with a product. Typically, most of the enumerations deal with one of the three products 

• [n] • [m] = [n + m] (ordinary generating functions) 

• [n] • [m\ = ( n+ n m ) [n + m] (exponential generating functions) 

• [n] ■ [m] = [rim] (Dirichlet generating functions) 

according to our initial problem. Depending on the product, one chooses to map basic elements 
on the different functions: 


• [n] ~ x 11 (ordinary generating functions) 

• [n] ~ ^ (exponential generating functions) 

• [n] ~ -A (Dirichlet generating functions). 

The products on series implement convolution products on sequences: 

• Ordinary generating functions: (a n ) n eN * (AO neN — UA +?= cti(3j J (ordinary convolution) 

V ' /nGN 

• Exponential generating functions: 


(^n)nGN ^ (^n)nGN 



(i) 


(binomial convolution) 


4 











• Exponential generating functions: (a n ) n<sN \ {0 } * (Ai)n<sN\{o} = J2a=n a ^j ) (Dirichlet 

V / neN\{0} 

convolution). 

For instance, the binomial transform of a sequence is interpreted in terms of exponential gen¬ 
erating series by the multiplication by exp(x): 

«p(*>2>! = E (£ (”)“•) P) 

n>0 n>0 \ i =0 x ' / 

At the other end, we define the characteristic series S_ of a set S as S = a id, i.e. ^2 oeS o where 
Id(o) = o denotes the identity on S. In the sequel, we will use characteristic series of languages on 
an alphabet A"; these series belong to the algebra C((A)) which is endowed with the catenation 
product extended by distributivity and continuity. 

Between these two extremes, many configurations are possible. We consider the case where 
T = the set of the integers sequences with a finite support {i.e. all the elements of the 

sequence are zero except a finite number) indexed by the set X (the set of variables). The series 
are realized by commutative variables as follows: 

• U r-u rw X v l x ] (ordinary multivariate generating functions) 

• v ~ FLex (exponential multivariate generating functions) 

• v ~ EUy (Dirichlet multivariate generating functions). 

Alternatively, we can replace a vector v e N( x ) by a partition of an integer. A partition of n is 
a decreasing finite sequence A = [Ai,..., A*,] of strictly positive integers verifying Ai + ■ • • + A*, = n; 
we will denote this by A b n. The number of parts of A, k in this context, is also denoted by #A. 
The product of two partitions [Ai]. [A 2 ] equals the partition obtained by sorting the catenation of 
the two partitions in decreasing order. For instance set Ai = [2, 2,1] h 5 and A 2 = [3, 2,1] h 6 then 
[Ai].[A 2 ] = [3, 2, 2, 2,1,1], If A" = (xi, x 2 ,... } is a discrete alphabet the algebra generated by the 
elements [A] is isomorphic to the algebra obtained by endowing N (X) with the product [w] [ 7 /] = 
[v + v'] (this corresponds to the ordinary multivariate generating functions). The isomorphism 
sends explicitly A to the vector v such that n[x*] is the number of parts in A equal to i. For instance 
15,3,2,2,1,1] r'•u [2, 2,1, 0,1, 0,...]. So we can replace the statistic described by vectors of hb x * by 
statistic described by partitions. 

In the paper we will also manipulate set partitions of {1,..., n}. In the aim to avoid confusion 
we will denote (integer) partitions by the symbols A, A,, A', A' etc. and the set partitions by the 
symbols 7r, 1 q, 7r', 7 t' etc. If n is a set partition of a set E , we will denote tt N E. For any n' C n, 
we also denote n' Ct= E. When E = {1,..., n}, we will write n N n for it N E and n' Ct= n for any 
subset t\' of 7T. 

3 Preliminaries 

There exist 16 binary boolean functions that can be expressed only with the operators A, V and 
A acting on variables and their negation (see Table [T]) . Each function is related to a set operation 
based on the classical correspondence (A, V, — A) <—> (fl, U, — , ©). For simplicity, we only use 
the second list of symbols to denote both. 
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y <1 <1 y Al y y ^ 

<S) X^A^A^, 

0000000011111111 
0000111100001111 
0011001100110011 
0101010101010101 
Table 1. The 16 binary boolean functions. 


Let us consider three regular languages M, N, P with respective deterministic state complexity 
m,n,p > 3. We look at the state complexity of M(N o P ) where o is a boolean operation. 

Let A = (E,Q A = {0,1,..., m - 1}, 0, F A , •), B = (E, Q B = {q 0 , q 1: ..., q n _ J, g 0 , F B , •), 
C = (E, Qc = {r 0 ,ri, ..., r p _i}, ro, Fc, •) be the three minimal DFAs for M, N, P respectively 
and consider the DFA D = (A, Qd = Qa x 2 QbX ® c , F B , •) defined as follows: 


Id — 


(0,0) if 0 0 Fa 

(0, (q 0 , r 0 )) otherwise 

the set of final states F B depends on the operation o: a state (i, S) is final if S contains a couple 
(q,r) satisfying (q G F B ) o (r G Fc) and 

for a symbol a G E and a state (z, S) in D , the function • is defined by 
(i ■ a, S ■ a) if i ■ a ^ F A 

(i ■ a, S ■ a U {(go, A))}) otherwise, 
where the dot notation is extended to a set of couples by S ■ w — (J(</r)es(® ' w ) x ( r ' w ) an( ^ 


(z, S) ■ a = 


w 


^ = U 


(g,r)eS 


,W 


q) x (w ■ r). 


The automaton A being deterministic, the definition of D implies that the accessible part of 
the subset automaton of A ■ (B o C) is the accessible part of F, therefore L(D ) = M(N o P ). 

For any state (z, S), the set S can be seen as a tableau with n rows and p columns where any 
cell (J, k) is marked if and only if the couple of states (qj, r*,) is in S (see Figure[2|). In the following, 
for simplicity, when the dimensions are fixed, we assimilate a tableau with the set of its marked 
cells. 



Fig. 2. The tableau corresponding to S = {( 173 , r?), (qi, rs), (®, rs)} with n = 6 and p = 7 


Since the state complexity of catenation is sc.(m, m') = (m — l) 2 m ' + 2 m '~ 1 and the state 
complexity of a binary boolean operation o is upperbounded by sc 0 (n,p) = np , from Claim [1] their 
composition allows to upperbound the state complexity of M(N o P) by (m — l)2 np + 2 np ~ 1 . This 
bound is reached when o = fi [4j. A study of o = U has been done in the same paper, but the 
approach was not exactly the same. 

Indeed, they use the fact that the union iVUP can be performed without computing any 
cartesian product, but by computing the union of the states of the automata B and C. Thus, 
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the catenation of A with B U C is an automaton the determinization of which produces states of 
the form (j, T, T r ) , with j a state of A and T,T' two subsets of states of B and C respectively. 
We can also notice that if j is final, the sets T and T' must contain the initial state of B and 
C respectively, and therefore that T is empty if and only if T' is empty. Then the bound is 
(m — l)(2 n+p — 2 n — 2 P + 2) + 2 n+p ~ 2 (only reached when A has only one final state). 

The state complexities of M (N fl P) and of M(N U P) are known. Then, knowing the state 
complexity of M(N © P) will lead us to the knowledge of the state complexity of M(N o P) for 
any non degenerative boolean operator. Indeed, as seen in Tabled] there are 16 boolean operations 
over two languages N and P and 6 of them are degenerative (0, B*, N, P, N, P). All the others can 
be expressed using one of the three boolean operators D, U and © and the complement operator 
over the single language N or P. As the state complexity of the complement of N (obtained 
by interchanging the states finality of B) is the state complexity of N, the state complexity of 
M(N(BP) will give us the state complexity of M(NoP) for any boolean operator o. Indeed, having 
a witness for M(N © P), M(N U P) and M(N fl P) will give us a witness for any combination 
M(N o P) as N o P can be expressed through one of the three boolean operations applied to 
the complementation of N and/or P. For example suppose that the triple of automata A,B,C 
is a witness for the combination M(N fl P). We want to produce a witness for the combination 
M(N — P). First, notice that this combination is M(N HP). So the triple of automata A, B 
and C is a witness for M(N — P) where C is the automaton C in which states finality has been 
interchanged. 

The state complexity of M(N U P) can be reinterpreted as follows in the automaton D. Let 
(z, S') and (i,S') be two distinct states such that the couples ( q x ,r x >) and ( q y ,r y i ) are in S' and 
S = S' U {(q x , ry)}. Then the two states (z, S) and (z, S') are equivalent. Indeed, to separate these 
states, one has to find a word w such that (1) S' ■ w is equal to a set of couples which members 
are both non-final and (2) (q x , ry) ■ w leads to a couple of states at least one of the two is final. 
The fact that q x ■ w or ry • w is final is contradicting (1). So (z, S) and (z, S') are equivalent. 

Such equivalent states have tableaux with specific patterns. Indeed, the tableaux for S and S' 
contain the pattern of Figure [3](a) and Figure [3](b) respectively. None of them can be distinguish 
from the pattern of Figure [3](c). So the number of equivalent states is the number of undistin- 
guishable tableaux which are represented by the patterns of Figure [3] The number of tableaux not 
containing patterns of Figure |3](a) or Figure [3](b) is (2” — 1)(2 P — 1) + 1. 





Fig. 3. Three undistinguishable tableaux (a), (b), (c), for the union operator 


Indeed, one has to choose among n rows and p columns (at least one of each) and mark every 
cell at the intersection of the chosen rows and columns ((2 n — 1)(2 P — 1)) plus one configuration with 
no cell marked. We also have to count the same tableaux but with the cell (0, 0) marked (2 n_1 2 p ^ 1 
tableaux). These observations lead to the state complexity (m — l)((2 n — 1)(2 P — l) + l) + 2 n_1 2 p_1 = 
(?7 1 - l)(2 n+p - 2 n - 2? + 2) + 2 n+ P~ 2 of M(N U P). 
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As there exist languages M, N and P such that there are no undistinguishable tableaux for 
M(NnP), the state complexity coincides with the bound. The study of undistinguishable tableaux 
for M(N © P) is done in the next section. 

4 An upper bound for the state complexity of M(N ®p) 

In this section, we proceed as in the previous one by considering the same automata A , B, C and 
D , setting o = ©. Notice that a state [i, S ) of D is final if and only if S contains a couple ( q x ,r y ) 
such that either q x E Fb or r y E Fq but not both. 

As for the union (see the previous section), some particular states are necessarily equivalent. 
Let (i,S) and (i, S') be two distinct states such that the couples (g x ,rv), ( q x , r y ') and ( q y ,r y /) are 
in S' and S = S' Li {(q y , r x ')}. Then the two states (i,S) and [i, S') are equivalent. Indeed, if a 
word w separates (i, S) and (i, S'), then w sends q y in F B or r x > in F c but not both, sending [i, S) 
to a final state of D. This cannot be achieved without sending {i, S') to a final state of D, thus 
contradicting the separation by w. This is formally proved in the next lemma. 

Lemma 1. Let E={q x , q y } x {r x >, ry}. Let (q, r) E E and w E E* with (q-wE F B ) © (r • w E Fc) 
Then there exists (q', r’) E E such that 

( qr') ^ (, q , r) A (q ■ w E F B ) © (r' • w E F c ) (3) 

Proof. Without loss of generality, let us set ( q,r) = ( q x ,r x >). We consider only the case where 
q x ■ w E F b and ry ■ w ^ F c ~, the other case can be proved similarly. The couple 

{ (g y ,7v) if q y -w EF B 
(q y , ry) if q y ■ w F B A r y > ■ w E F c 
( q x , r y ’) otherwise 

satisfies (J3]) . a 

Such equivalent states imply undistinguishable tableaux as described below. Four distinct 
marked cells si, S 2 , S 3 and S 4 dehne a rectangle if there exist four integers x , x', y and y’ such that 
{si, s 2 , s 3 , s 4 } = {q x ,q y } x {ry,ry}. Three distinct marked cells Si, s 2 and s 3 form a right triangle 
if there exists an unmarked cells S 4 such that si, s 2 , S 3 and S 4 form a rectangle (See Figure |4] and 
Figure [5]). A tableau S is saturated if it does not contain any right triangle. For each tableau S, 
we dehne Sat (S') as the smallest saturated tableau containing S. We can notice that Sat(S) is the 
intersection of all saturated tableaux containing S. Its existence is ensured since the tableau with 
each cell marked is saturated. Its unicity is due to the fact that the intersection of two saturated 
tableau containing S is still a saturated tableau containing S. 









X 



X 









X 



X 










Fig. 4. A rectangle. Fig. 5. A right triangle. 

Alternatively, we can dehne saturated tableaux as the hnal step of a confluent rewriting process. 
We denote by S —> S' if S' is obtained from S by marking one cell which completes a right triangle 
in S into a rectangle in S'. The rehexive and transitive closure —>■ of —)• is a confluent partial order 
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whose maximal elements are the saturated tableaux. It follows that a property is preserved by 
the transformation —y if and only if it is preserved by the transformation — y. We illustrate this 
principle with two lemmas which will be used below. 

Lemma 2. Let S be a tableau where a cell ( j,k ) is marked. If a tableau S' contains no marked 
cell on row j (equivalently on column k) then Sat (S') 7 ^ Sat(S'). 

Proof. First remark that if T —> T' then T contains no marked cell on row j (resp. column k) if 
and only if T' contains no marked cell on row j (resp. column k). Indeed, T differs from T' in only 
one marked cell which completes a right triangle in T into a rectangle in T'. So, this cell belongs 
to a row and a column in T which both contain a marked cell. 

We deduce that if T —> T' then T contains no marked cell on row j (resp. column k ) if and 
only if T' contains no marked cell on row j (resp. column k). 

Let S and S' be two tableaux such that a cell (j, k) is marked in S and tableau S' contains no 
marked cell on row j (resp. on column k). Since S' A Sat (S'), Sat (S') contains no marked cell on 
row j (resp. column k). Obviously, this is not the case for Sat(S) where the cell ( j , k) is marked. m 

Lemma 3. For any state (z, S) of D, (z, S) G Fd if and only if (z, Sat(S)) G Fo- 

Proof. We define the set Fg of the tableaux which have a marked final cell, that is a cell belonging 
to a final row (a final row corresponds to a final state of Qb) or to a final column (a final column 
corresponds to a final state of Qc) but not to both. The statement is equivalent to S G F$ if and 
only if Sat(S) G F s . Let S and S' be two n X p tableaux. It suffices to prove that if S —* S' then 
S' G Fs implies S G F$. One has to discuss on the position (j, k) of the unique cell which is marked 
in S' and unmarked in S. 

1. If this cell is not final, then all the final cells in S are unchanged in S'. Hence, S G Fs since 
S' G F s . 

2. If this cell is final, on a final row, then there exists j' 7 ^ j and k' 7 ^ k such that (j', k ), (j, k ') 
and (f, k') are the positions of marked cells in S and S'. If the cells on (j', k) and (j, k') are 
not final then the row j' is not final and the column k' is final. Hence the cell on (j', k') is final. 

3. If this cell is final on a final column, this case is treated symmetrically to the previous one. 


As a direct consequence of Lemma [TJ we have: 

Lemma 4. Let i be a state of Qa and let S and S' be two tableaux such that S —y S'. Then the 
two states (z, S) and (z,S") of D are equivalent. 

As a consequence, any state (z, S) in D is equivalent to the state (z, Sat(S)). It follows 

Lemma 5. For all (z,S) G Qd we have (z, Sat(S)) G [(z, S)]. 

Let a n _ p be the number of saturated tableaux with n lines and p rows. Furthermore, if z is final 
in A, then S contains (qo, r 0 ) and consequently so does Sat(S). Let a' np be the number of such 
tableaux where the cell (qo, Tq) is marked. Consequently 

Theorem 1. sc(M • ( N © P)) < (m — 1 )a n # + a' np 

Proof. By definition, sc(M • ( N © P)) is the number of states of the minimal AFD recognizing 
M ■ (N © P), that is the number of states of the accessible part of F/~. Trivially, this automaton 
has less states than F>/~. From Lemma 0 Z)/~ has less than (m — l)a„ iP + a' np states. B 

The next section is devoted to explicit the numbers a U)P and a' n p . 
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5 Counting saturated tableaux 

According to previous sections, the notion of undistinguishability induces an equivalence relation. 
Each equivalence class of tableaux contains a unique saturated tableau. 

We first recall some facts about the classical example of the set partitions (Section 15.11) . The 
three next sections are devoted to the enumeration of the saturated tableaux. We proceed as 
follows: We first give an algebraic interpretation of the saturated tableaux in terms of free monoids 
(Section 15.211 . Hence, in Section [5.31 we investigate some properties involving the characteristic 
series of saturated tableaux and we deduce, in Section 15.41 the desired enumeration. 


5.1 Bell polynomials 

Enumeration of saturated tableaux is closely related to set partitions. Well known results about 
combinatorial objects and their generating series (Bell polynomials) are recalled here. Readers 
wishing to deepen their knowledge on set partitions should refer to mm- 

Complete multivariate Bell polynomials A n (di ,..., a n ,...), named in honor of Eric Temple Bell 
by Riordan |18] . are multivariate polynomials defined by their exponential generating series: 


_ +n 

^ ^ A n (ai, . . . , d n , . . .) 7 

n\ 

n> 0 


exp 



(4) 


_ +m 

y dm— 

m> 1 m ' 

Bell polynomials are themselves multivariate ordinary generating functions of set partitions. 
Indeed, if 7r = {7Ti,..., 7 t*,} is a set partition of {1,..., n) with ^7Ti > • • • > #7Tfc, we denote by 
\ n = ..., #7r fc ] the (integer) partition associated to 7r. We have 

4b(cq, • • • , d n , • • •) ^ ^ 'y ^ S\d \ 1 • • • d\ k 

tv \=n Ab n 

where S\ denotes the number of set partitions 7r such that = A. 

Example 1. For instance, we have A 4 (ai, a 2 , a 3 , a 4 ) = d\ + 6a 2 a 4 + 3a2 + 4a 3 a 4 + a 4 , since the 
partitions of ,4} are 



Att 

set partitions 

[1,1,1,1] 

{{1},{2},{3},{4}} 

[2.1.1] 

{{1,2}, {3}, {4}}, {{1,3}, {2}, {4}}, 
{{1,4},{2},{4}},{{2,3},{1},{4}}, 
{{2,4},{1},{3}}, {{3,4},{1},{2}} 

[2,2] 

{{1,2},{3,4}},{{1,4},{2,3}}, 
{{1,3},{2,4}} 

[3,1] 

{{1,2,3},{4}},{{1,2,4},{3}}, 
{{1,3,4},{2}},{{2,3,4},{1}} 

[4] 

{{1,2,3,4}} 


When each a* is an integer, one interprets it as the number of ways one can color a part of 
cardinal i (or equivalently, each a, represents the weight of each part of cardinal i). If each a* 


10 
















equals 1 then the complete Bell polynomial A n (ai,..., a n ,...) specializes to the Bell number B n 
which counts the number of set partitions of {l,...,n} i.e. A n (l ,..., 1,...) = B n . Hence, the 
(exponential) generating series of Bell numbers is 'Yh n > 0 B n ^ = exp(exp(t) — 1) and the first 
Bell numbers are 1,1, 2 , 5,15, 52, 203, 877,4140, 21147,115975,... (see sequence A000110 in [ 20 ]). 
From the previous remark, the number of set partitions of {1,..., n} with no part of size 1 equals 
A n ( 0 , 1 ,..., 1 ,...) and the associated generating function is 

_^ j-n 

V A n ( 0,1,..., 1, - - -)—r = exp (exp (t) - 1 -t). (5) 

nl 

n> 0 

The first values of A n ( 0,1,..., 1) are 1, 0,1,1,4,11,41,162, 715, 3425,17722, 98253,... (see se¬ 
quence A000296 in [ 20 ]). Noticing that exp(t) ^ n>0 A n ( 0,1,..., 1,.. .)^ = X^n>o we deduce 
(see ©) that the Bell numbers sequence is the binomial transform of the sequence (A n (0,1,..., l)) nG pj 
i.e. 

Bn = jz{' l )M 0 , 1 , ■■.,!,■■■)■ ( 6 ) 

When each part have the same weight x, the exponential generating function of B n (x) = 
A n (x, ...,x) is 

y^B n {x)— = exp(x(exp(t) - 1)). (7) 

nl 

n> 0 

Notice that each B n (x) is a degree n polynomial in x also called Bell polynomial in literature. 
Its expansion gives 

n 

B n (x) = S n , k x k , ( 8 ) 

k =1 

where S n j- is a Stirling number of second kind counting the number of ways of partitioning the 
set {1,... ,n} into k blocks (see sequence A008277 in [20]). We recover the Bell numbers setting 
x — 1, that is B n = B n { 1). The numbers r n = B n {— 1) are the Rao Uppuluri-Carpenter numbers 
[21 ] whose first terms are 1, —1, 0,1,1, —2, —9, —9, 50, 267,413, —2180, —17731, —50533,... (see se¬ 
quence A000587 in [ 20 ]). From (J7]) the generating function of the r n ’s is 

yv n —= exp(l-exp(t)). (9) 

nl 

n> 0 

Equality (JHD gives r n = J^^ = 1 (— l) k S n ^ and provides the combinatorial interpretation of the 
Rao Uppuluri-Carpenter numbers: r n is the number of set partitions of {1 ,... ,n} with an even 
number of parts minus the number of such partitions with an odd number of parts. 

5.2 Monoids of tableaux 

We consider the set 7 " n of n rows tableaux. It contains a special tableau e n with no column and we 
endow it with a product • such that T ■ T' is the tableau obtained by gluying T' to the right of T. 
Notice that T ■ e n — e n ■ T = T. So, since the product is clearly associative, the set T n is a monoid 
and more precisely 

Lemma 6 . Let C n = {05 | S C {1,..., n}} be an alphabet indexed by the subset of {1,... ,n}. 
The monoid T n is isomorphic to C*. 
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Proof. Let T £ T n be a tableau with m columns. For any 1 < j < m we define Sj(T) as the 
set of indices i such that the ith cell of the column j is marked. Straightforwardly the map 
T — > Cg 1 (t) • • • c s p (T) is an isomorphism of monoid. B 

Example 2. Consider the following tableau and its associated word: 



C{l,4}C{3}C0C{i}C{ 3j 4} 

Consider the subset T n of C* consisting of all the words representing saturated tableaux with 
n rows. 

For each S Ci= n, we define the monoid M$ = ({c@} U {c s |s £ S})*. The following result char¬ 
acterizes the saturated tableaux. 

Lemma 7. A word belongs to T n if and only if it belongs to a monoid Ms for some S. 

Proof. Let w = c si ■ ■ ■c Sp be a word of T n . Then s, U Sj = 0 or s* = Sj for each i ^ j. Indeed, 
suppose that k £ s* D Sj and l £ s, such that t ^ k. Hence, if T denotes the tableau represented 
by w then the cells (k,i), ( k,j ) and (£,i) are marked. Since T is a saturated tableau, the cell (£, j) 
is also marked, ft follows that S w — {sj|l < i < p} C|= {1,..., n} and w £ Ms w . 

Conversely, if w £ Ms for some S C|= {1,... ,n} then w straightforwardly represents a saturated 
tableau. m 

Equivalently, 

T„ = (J M s . (10) 

S<Z^n 

Notice that this union is not disjoint since MsnS' — Ms D Ms>. 

5.3 Characteristic series of tableaux 

We consider the set M s = {w £ M s \w ^ M s > for any S' C S}. 

Example 3. For n — 2 we have 

• = c ; _ 

• £%}} = \ 

• ££{{2}} — ^f{{2}} \ ^h = ^{{2}}C{2}M{{2}} 

• = M {{1i2}} \ M 0 = 

• a '£{i>,{ 2}} = A/ £{i},{2}} \ (m$ U m {{1}} u m {{ 2}} j 

So we can rewrite (TTOli as a disjoint union T n = |^J Ms . The disjoint union implies that the 

SCi=n 

following identity on the characteristic series holds: 

Jn = Mi.' (H) 

Sc^n 
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Example 4- We have: Tg = Mg + Myyiyy + A<f{{ 2 }} + M{{ 12 }} + Myyiy^yy- In the following table we 
summarize the first terms of each Mg\ 

Mg = £ + C0 + Cg + . . . 

M{{1}} = C{1} + C0C{1} + C{1}C0 + C| X | + . . . 

M{{2}} = C{2} + C0C{2} + C{2}C0 + C| 2 | + . . . (12) 

M{{1,2}} = C {1,2) + C0C{i, 2 } + C{i i2 }C0 + <^ 12 j + . . . 

M {{!},{2}} = C (1} C {2} + C (2}C{1} + C| 1} C {2 } + C { i } C{ 2 }C { i } + C { i } c| 2} + C {2 }C| 1} + • • • 


Hence, 


Tg — £ + C0 + C{1| + C{2} + C{12} + c 0 + C {1} + C 0 C {1} + c {2} + C 0 C {2} + C{2}C0 + 
C {1,2} + C 0 C {1,2} + c {l,2} c 0 + C{ 1}C{2} + C{2}C{i} + . . . . 

This corresponds to the formal sum of the following saturated tableaux 

e c 0 c {1} C{ 2 } c { i, 2} 


0 


0 00 


0 


10 00 


0 0 


0 0 


00 0 


00 


0 0 




Cg C {1}C0 C0C{1} c| 1} C { 2}C 0 e 0 C {2 } c| 2} C 0 C {1,2} C{1,2}C0 C| 1>2} c {l} c {2} C {2}C{1} 

Notice also that each ic e Mg belongs to one and only one Mg' with S' C S. This gives 

Mg = ^ Mg/ . (13) 

S'CS 


Example 5. For S' = {{1}, {2}} equality ([T3j is M{{i} ){2 }} = M^ + My^yy + Myy 2 }} + -^{{i},{ 2 }}, and 
from (U2D, we recover: 

M{{ 1}i { 2 }} = ^ + C0 + C{1} + C{2} + C0 + C0C{1} + C0C{2} + C{1}C0 + C| 1 | + C{1}C{2} + C{2}C0 + C{2}C{1}+C| 2 | + - • • 

We define for each E C {1,..., n} the set 

T s = |J Mg = |J Mg. 

SC^E Sd^E 

As for the set T n , the last union is disjoint. So we have 

Je=J2ms- ( 14 ) 

SC^E 

We have the following equality for the characteristic series: 

Proposition 1. 

^Mg= ^ A n _ #£ ;(0,1,..1,.. -)T e (15) 

S\=n EC{l,...,n} 
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Proof. We define V n = Y!st=n Mg , support (S') = U^es-^ f° r an Y se ^ °f se ^ s & and ##>5 = 
^support(S'). When expanding V n on the Mg, we obtain: 

>i=EE M S ' = ^ #{S b n | S' C S'} Mg/ 

Shn S'CS S'C^n 

Observing that #{S b n \ S' C S'} equals the number of set partitions of {1,..., n}\support(S / ) 
we obtain 

K = B n _ ##s Ms. (16) 

SC|=n 

On the other hand, we define (a n )neN such that V n = n -\(in-#E^E- 

Expanding this equality on the Mg, we find 


Vn= Y. 


Q"n — 


n-#E 




E 


a n -#E M s 


Ec{l,...,n} SCi =E SCt=n \ support(S)C-EC{l,...,ra} 

n-##S 


Observing that an ~# E = 

support (S') C-E7c{l,...,n} ^=0 


n - ##S^j 


«n-##g-i , we obtain 



Comparing the coefficient of Mg in (fl6l) and (117)1 . we hnd Bk = Yli =o (i) a *- That is: Bell 
numbers are the binomial transform of the af s. From (|6]) we deduce a* = Ai( 0,1,..., 1,...) and 
our statement. ■ 


Example 6. For n — 2, on the left hand we have 

M{{i},{2}} + M{{ 1 , 2 }} = 2 Mg + Tf{{i}} + M{{ 2 }} + M{ji} > {2}} + M{{i j2 }} 


and on the right hand side we have 

A 2 (0, 1,..., 1,.. .)Tg + Ai(0,1,..., 1,...) (j\ i} + r {2} ) + -A 0 (0,1,..., 1,.. .) T{i ;2 } = 
Tg + 7 {i j2 } = 2 Mg + M{{i|} + M {{2}} + M {{ i } i{2}} + M{{i i2 }|. 


5.4 Generating series of saturated tableaux 

Let us first enumerate the elements of Mg with respect to the number of columns (length of the 
word) and the number of marked cells in the asociated tableaux. Noting that the characteristic 
series of Mg is given by 


E 


w = 


1 ( c 0 + 


(18) 


w£Ms v v 1 '-^E^S 

we obtain the bivariate generating series sending each letter Ce to the monomial xt* E 


m s (x,t) = r ijX H3 i 
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where k f ■ denotes the number of words w = Ce 1 ■ ■ -Ce^ in Ms of length i and weight j£Ei + • • • + 
#Ei = j. Since the map sending each letter ce to xt* E is a morphism from Q((Cg)) to Q[[x, t]], 
the right hand side of (fT5|) gives 


m s (x,t) 


1 

1 - (1 + P s {t))x 


(19) 


where P s (t) = T,Ees t#E - 

Example 7. Let S = {{1,4}, {2}, {5}}. We have 

A%’ 4 }’{2},{5}} — i_( C0+C{lj4} _|_ C{2} _|_ C{s} ) 

= e + (c0 + C|i 4| + C{2} + C{5}) + (c0 + C|i 4| + C{2} + C{5}) 2 + ••• 

= e + C 0 + C{ lj4 | + C{ 2 } + C{ 5 j + C 0 + C 0 C{ lj4 } 

+C 0 C{ 2 } + C 0 C{ 5 } + C{i j 4 }C 0 + cJ lj4} + C{i j 4 }C{ 2 } + C{i j 4 }C{ 5 } 

+C{2}C0 + C{ 2 }C{i )4 j + C| 2 | + C{ 2 }C{ 5 } 

+C{5} C 0 + C {5} C {1,4} + C{5}C{ 2 } + C| 5 j + • • • 

Furthermore Ps(t ) = 2t + t 2 , so we have 

ra{{i,4},{2},{5}}0*5 £) = Y^ETT+~2t + t 2 )r = 1 + ^ + + + (-*- + ^ 2 + 4t 3 + t 4 )x 2 H- 

For instance, we deduce = 6 corresponding to the 6 words: C 0 C{i j4 }, C{i i 4 }C 0 , c 2 2 |, 

c {2} c {5}) c {5} c {2} an d c { 5 j- 

In fact, the series ms(x,t ) depends only on the size of the sets of S. Let A = [A 4 ,..., A&] 
be a partition, we define P\(t) = Y2i=i^ Xi an d m x{t) = 1 _( 1+ p A ( f )) t . • If S — {Ei,...,Ek} with 
#Ei > ■ ■ • > we have 

m s (x, t) — m\ g (x, t) ( 20 ) 

with A s = [#£i, 

Denote by Tab n (x,t) the generating function of the saturated tableaux with n rows 

Tab n (x, t) = y, a r , n . p x p t J , 

m, j 

where (Xj. n>p denotes the number of saturated tableaux with n rows, p columns and j marked cells. 

So, the coefficient of x p in Tab n (x,t) is a polynomial ot np (t) which is the generating series of the 
7i x p saturated tableaux counted by the number of marked cells. 

The series Tab () ,(x, t) is the image of T n by the morphism ce —> xt* E . Notice that the image of 
Tp is also Tab # E (x,t). 

Hence Equation (fT5l) becomes 

y m s (x,t) = y A n - #E (0, 1, ■ ■ ■, l)Tab #B (x,£). ( 21 ) 

Shn Ec{l,...,m} 

The number of set partitions S' of {1,..., n} such that Xs = A equals with A! = (IT^(A*!)) (11^(multi(A)!)) 
where mult j (A) denotes the multiplicities of the part i in A. 
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Example 8 . Consider A = [2,1,1,1], The number of set partitions of S such that Xs = A equals 


5! 


= 10. The corresponding partitions are: 


(2!1! 3 )(1!3!) 


{{1,2}, {3}, {4}, {5}} 
{{1,4},{2},{3},{5}} 
{{2,3},{1},{4},{5}} 
{{2,5},{1},{3},{4}} 
{{3,5},{1},{2},{4}} 


{{1,3}, {2}, {4}, {5}} 
{{1,5}, {2}, {3}, {4}} 
{{2,4},{1},{3},{5}} 
{{3,4},{1},{2},{5}} 
{{4,5},{1},{2},{3}}. 


Furthermore, the number of subset of {1,..., n} with a fixed size i is equal to ("). Hence from 
(12T1) we obtain 



On the right hand of this equality, we recognize the binomial convolution of the sequence (A;(0,1,..., 1)); 
with the sequence (Tab n _j(x, £))* (see Equality ([1])). Interpreting (1221) in terms of exponential gen¬ 
erating functions we set 



and 



n =0 


Hence, using (J5]), Equality (1221) becomes 


m(x, t, z) = exp(exp(z) — z — l)Tab(x, t, z ). 


(23) 


So, 


Tab(a;, t, z) = m(x, t , z) exp(— exp(z) + z + 1) = —m(x, t, z)— exp(l — exp (2)). (24) 


dz 


We recognize the exponential generating function of the Rao Uppuluri-Carpenter numbers (J2j) . As 


the coefficient of x p z n in the left hand-side and the right hand-side of (1241) are equal, we obtain 


Theorem 2. The generating function of the n x m saturated tableaux counted by numbers of 
marked cells is 



(25) 


Example 9. Let the first values of the polynomial a n , p (t): 


• a hp (t) = (1 + t) p , 

• a 2 ,p{t) — (1 + 2 t) p + (1 + E) p — 1, 

• a 3 ,p{t) = (3t + l) p + 3 (t + t 2 + l) p — 3(1 + t) p + (t 3 + l) p — 1, 

• onjt) = 2 + (1 + 4 t) p + 6(1 + 2 1 + t 2 ) p + 3(1 + 2 t 2 ) p + 4(1 + t + t 3 ) p + (1 + t A ) p - 4(1 + t) p - 
6(1 + 2 t) p — 6(1 + t 2 ) p , 

• a 5iP (t) = 9 + 10(l + t) p + (5t + l) p +10(3t + t 2 + l) p -10(3t + l) p +15(t + 2t 2 + l) p -30(t + t 2 + l) p + 
10(2t +1 3 + l) p -10(1+ 2t) p +10(t 2 + t 3 + l) p -10(1+ t 2 ) p -10(t 3 + l) p + 5(t + t 4 + l) p +(t 5 + l) p 

For instance, the coefficient of t 9 in a 4 3 is 4. This corresponds to the tableaux 
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X 

X 

X 

X 

X 

X 




X 

X 

X 


X 

X 

X 




X 

X 

X 

X 

X 

X 


We illustrate the fact that a n>p (t) = a p , n (t) by remarking 


a 3A (t) = (3 1 + l) 4 + 3 (t + t 2 + l) 4 - 3(1 + t) 4 + (f 3 + l) 4 - 1 = 

a 4 , 3 (t) = 2 + (1 + 4 t) 3 + 6(1 + 2 1 + t 2 ) 3 + 3(1 + 2t 2 ) 3 

+4(1 + t + t 3 ) 3 + (1 + t 4 f - 4(1 + tf - 6(1 + 2 t) 3 - 6(1 + t 2 ) 3 = 

t 12 + 4 1 9 + 3 t s + 12 1 7 + 36 t 6 + 48f 5 + 135 1 4 + 148 1 3 + 66 1 2 + 12 1 + 1 


Setting t — 1 in a rym (t) we obtain the number of n x m saturated tableaux. 
Corollary 1. The number of n x m saturated tableaux is 


n 

®-n,rn ®n,m( 1) ^ ^ ^ ^ 

fc=0 w(A)<=n 
#A=fc 


t+A)+l 

A!(n — u;(A))! 


(* + 1)”\ 


(26) 


where a;(A) = k if A h k. 


Example 10. Let the first value of a n ^ m below: 


1 


1 

1 




1 

4 

12 



1 

8 

34 

128 


1 

16 

96 

466 

2100 

1 

32 

274 

1688 

9226 

1 

64 

792 

6154 

40356 

1 

128 

2314 

22688 

177466 


48032 

245554 1444212 

1251128 8380114 54763088 


Proposition 2. 

equals 

a 'n,p 


The number of tableaux having a specific marked cell in anx m saturated tableau 


1 d 

np dt 


®-n,p{t) 


t= 1 


— (n 


r n-i+l ST-' + -P\(1)) P 1 

''^(n-i)\^ A! 

*=0 v ' AH 


Proof. Let affp be the number of n x p triangle free tableaux such that the cell (i,j) is marked. 
Each cell having the same role, we have affp = On°p 0) = o/ . Hence 


a 


n,p 


^ n —1 p— 1 

= —YYa^ 

ryn Z_^ Z_^ n *P 

T i =o j =o 


1 

np 


Y.MCT) 


where the last sum runs over the n x p triangle free tableaux T and A4(T) denotes the number 
of marked cells in T. Notice that J^ t A 4(T) is the total number of marked cells in all the n x 
p triangle free tableaux. Acting by multiplies the coefficient of P in a niTn (t ) by j so that 
t^t a n,p(t) | ( _ 1 = fh a n,p(t) | t _ 1 is the number of marked cells in all the n x p tableaux. We deduce 

a n,p = 77 it a n,p(t) \ t=1 - m 
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Example 11. We list below the values of a' np for n = 1,..., 6: 

• = 2 p -\ 

• <4,p = s^- 1 + 2 P_1 , 

• a' 3tP = 4P- 1 + 3 • 3 p -\ 

• a^’ p = 5 P_1 + 6 • 4 P_1 + 4 • 3 P_1 - 3 • 2 p - x , 

• a ' 5tP = -7 • 2 P_1 + 6 P_1 + 10 • 5 P_1 + 19 • 4 p - x - 7 • 3 p - x , 

• a ' 6p = -59 • 3 -1+p + 20 • 4 p - x + 55 • 5 p - x + 7 p ~ l + 15 • 6 p -h 


For instance, consider the saturated 2x2 tableaux: 


X 


X XX 


XI 


X XX 


ax XL 


S XE 


XX X 
XX 

4 


X XI 


x 


We hnd a' 22 = |(l-0 + 4- l + 6- 2 + l-4) = 5 = 3 1 + 2 1 as expected by the formula. 


6 Witness 

In this section, we use the Brzozowski automata A = W m (a,c,b,d ), B = W n (a,b,c,d) and C = 
W p (d,b,c,a) (with m, n,p > 3) mentioned in Section [2j page |4] and recognizing three languages 
M, N and P. They are represented in Figure [6] where the red arrows suggest how to connect the 
automaton A to the automata B and C to recognize M ■ (N © P). 



Fig. 6. The automata A, B and C 


ft is obvious that A, B and C are minimal since they contain only one final state and, in any 
of them, one symbol acts as a cycle over the set of states. To prove (M, N, P ) to be a witness for 
the composition of catenation with symmetric difference, we have to prove that the DFA 77, built 
from A, B and C by following the construction described in Section 0 reaches the bound computed 
in Section [4j Some properties of A, B and C are useful. Especially, one can obtain rotations and 
groupings on their states reading some particular words. 

The following properties are true for W n (a, b, c, d), but they remain valid for the other automata 
by just permuting role of the symbols: 

— The word a k induces a k- rotation over Qb- 
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— The word ( ah )( k ~'A induces a grouping of q n -k and q n - 2 sending these two states to q n -i, q n - 2 
(with 2 < k < n). The word (ba induces another grouping which sends the states q n -k, 
q n -i to the states g 0 , g n _ 1 (with 2 < A: < n — 1). 

Remark 1. Any word in {a, b , d}* acts as a permutation on Qb x Qc■ Consequently for any word 
w G { a,b,d }* and any set S C Qb x Qc, |w.S| = |5| = \S.w\. 

Proposition 3. Each state of D is accessible. 

Proof. By induction over the structure of the states of D. The initial state is (0, 0) and any state 
of the form (i, 0) with i < m — 1 is reached from it by a 1 . 

Let a G [0, np — 1] and consider now any state (i, S ) with (A) = a + 1. We show its accessibility 
by cases, using only words in {a, b, d}* acting as permutations according to Remark [0 

1. If i — m — 1 then S = {(go, A))} W S' and the state ( i , S ) is reached by a from (m — 2, a ■ S'), 
itself accessible by induction hypothesis. 

2. If i — 0 then we have three cases to consider: 

(a) If S' = {(gi,r 0 )} W S' then (0, S ) is reached by a from (m — 1, {(go,T 0 )} W a ■ S') which is 
accessible by[Q 

(b) If S = {(gi,Tfc)} l±l S' then (0, S') is reached by d k from (0, {(gi,r 0 )} l±l d k ■ S') which is 
accessible by l2al 

(c) Otherwise, S = {(g?, r k )} l±) S'. Then the word w = (abb)^ -1 ^ mod n allows to reach (0, S) from 
(0, {(gi,r fc )} l±J w ■ S') which is accessible by [2bJ 

3. For any 0 < i < m — 1, the state (i, S) is reached by a‘ from (0, a 1 ■ S) which is accessible by [23 


As a direct consequence, 

Theorem 3. Each state of D /~ is accessible. 

So D/~ is the minimal DFA of M ■ (N © P). Furthermore, we know from Lemma [5] that any 
state [(i, S')] of D/~ contains ( i , Sat(S')). In order sc(M • (N © P)) to reach (m — l)a UtP + a ' np , 
it remains to prove the non equivalence of any two distinct saturated states of D, where a state 
(i, S) is saturated if S is a saturated tableau. 

Theorem 4. Any two distinct saturated states of D are non-equivalent. 

Proof. Let si = (A, Si) and S 2 = (A, S' 2 ) be two distinct saturated states. There are two cases 
to consider: If i\ 7 ^ i 2 , let us suppose i\ > A (without loss of generality). Then, using the word 
w = ccd p ~ 1 a m ~ 1 ~ n , we send si to (i\ , S[) and s 2 to ( i' 2 , Sf) with (g 0 , r 0 ) G Sat(S'()\Sat(S , 2 ), reducing 
this case to the other one, next described. For an explanation, the prefix cc acts as a contraction 
which allows, in particular, to exclude ri of S 2 with no move on A. Then the factor d p ~ l induces 
a rotation on C to exclude r$ of S 2 , still with no move on A. Last, the suffix takes A to 

m — 1, involving the emergence of the couple (go, To) i n S[, whereas the state r 0 remains excluded 
of S' 2 . Then, following Lemma[2j Sat(S'() 7 ^ Sat^). 

Now, we look at the other case. If Si 7 ^ S 2 , let (qj,rQ G Si\S 2 (without loss of generality). A 
state (A S) is final if and only if S holds a final couple of the form (g n _i,rfc) with k 7 ^ p — 1 or 
(gj, r p _ 1 ) with j 7 ^ n — 1. We will prove it is possible to build a word sending (g,, rg,) on such a final 
couple while sending any other couple on a non-final couple. And then we will be able to conclude 
following Lemma [3j 

First, we define the sets R = {rk'liqj^k') G S' 2 } and T = {gj}U{gy |(gj/, rv) G S 2 for some rv G 
R} and we show the following lemma 
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Lemma 8. One has T x R C S' 2 . 

Proof. Let ( q,r ) E T x R. If q = qj then the property is obvious. Else (qj,r) E S 2 and there exists 
r' E R such that ( q,r ') G S' 2 . If r = r' then again the lemma is straightforward. In the other cases, 
one has (qyr') G S 2 and since S 2 is saturated we also have (g,r) G S 2 . ■ 

Now, we partition the states of B and C using a coloration: we color in green the states of 
R U T, in black the other states appearing in S 2 plus the state r and in grey the states not used 
in S 2 (see Figure [3 for an illustration). In fact, the colors spot states on the automata B and C 
and, by reading a word, we move the colors on other states. 





Fig. 7. An example of coloration for states of B and C. 


So we associate a pair of colors to each couple in S 2 and Lemma [ 8 ] implies that any couple of 
S '2 is either green-green or black-black. The couple ( qj,Vk ), only present in Si, is green-black. 

In the remaining of the proof, we compute a word sending 

1 . the couple ( qj,rk ) on a final couple of the form ( q n -i,rk ') with k! 7 ^ p — 1 , 

2 . all green-green couples on the non-final (g n _i,r p _i) and 

3. all black-black couples on non-final couples of the form (qy, tv) with j' 7 ^ n — 1 and k' 7 ^ p — 1. 

Clearly, such a word separates si from s 2 , sending the first one to a final state (because of point 
[T]) and the second one to a non-final state (because of points [2] and [3]) . For instance, the colors in 
Figure |3 become as in Figure EJ 



Fig. 8 . A configuration distinguishing the states which correspond to the coloration of Figure [7] 


To prove the existence of a word satisfying the three points above, we proceed by induction 
on the number of green states. For each step we will use the notions of rotation and grouping 
described in Section [2] in order to decrease the number of green states. The difficulty lies not only 
in decreasing this number, but also in avoiding to increase it inadvertently. Indeed, we have to look 
at the effect of a word over three automata simultaneously. So, to convince the reader, we first list 
the 6 movements which could increase the number of green states. More precisely, there are only 
two ways to increase the number of green states: ( 1 ) use a letter c (acting as a contraction) which 
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makes green states appear by saturation, independently of the current state of A; (2) use a letter 
which moves the current state in A to m — 1. 

First, we say that a couple of states (on a same automaton) is uncontractable if one is green 
and the other is black. Consider an uncontractable couple over Qb (or Qc )• If we read a word 
which sends both states on a same state then new green positions could appear (those connected 
with the black position before reading the word). Regarding this problem, two movements must 
be avoided: 

(mvtl) - Reading c when (q 0 , q\ ) is uncontractable. 

(mvt2) - Reading c when (ro, n) is uncontractable. 

Next, we say that a couple of states (on two distinct automata) is sensitive if one is green and 
the other is not green. Consider a sensitive couple of Qb x Qc- If we read a word sending it to 
(g 0 , r 0 ) while we, simultaneously, access to the final state of Qa, then the new couple appearing 
connects a green position with a non green one. Since, from Lemma 0 and by saturation, the green 
positions always constitute a biclique, the non green state and all those connected to it become 
green. In relation with this problem, four movements must be avoided: 

(mvt3) - Reading a when (g n _i,ro) is sensitive and the current state on Qa is m — 2. 

(mvt4) - Reading b when (qo,r o) is sensitive and the current state on Qa is m — 1. 

(mvt5) - Reading c when one couple in {qo,qi} x {r 0 ,r!} is sensitive and the current state on Qa is 
m — 2. 

(mvt6) - Reading d when (qo,r p -i) is sensitive and the current state on Qa is m — 1. 

Now, we start the induction. By definition, the state qj is green. If it is the only one, then two 
situations may occur: 

i/ If there exists a grey state rv £ Qc then the word d p ~ l ~ k ' a n ~ 1_:? does the job, with a first 
rotation over Qc to convey the grey state on r p _i and a second one over Qb to send the green 
state on q n -\. If the reading of the cfs from s 2 induces (mvt6)-movements (here, this implies 
j — 0), we add the prefix a to the previous word and the suffix a n-1 ~ J becomes a n ~ 2 . 
ii/ If there is no grey state in Qc then we create one by making a contraction c which ensures rq to 
be immediately grey, and we come back to the previous case. If this contraction from s 2 induces 
a (mvtl)-movement, we first read an a (if j = 1) or two a’s (if j = 0). Observe that no (mvt3)- 
movement can appear since there is only one green state (which cannot be, simultaneously, on 
q n - 1 and go or qi)- Similarly, if the contraction from s 2 induces a (mvt5)-movement, we read 
the prefix aa before the c. 

If there are two green states then, necessarily, one is qj and the other belongs to R. We proceed 
exactly as for case i/ with the second green state playing the role of the grey state r*./. 

Now, suppose there are a + 1 green states with a > 2. First, observe neither T nor R can be 
empty. We have two cases to consider. 

— If R contains at least two states. Among them, we denote r fc / and ry with k' 2 > k \, those having 
the biggest indices. We make a rotation to send r fc / on r p _i, then a grouping to have both green 
positions on r 0 and r p _i and last, a contraction to decrease the number of green states and 
solve the problem by induction hypothesis. So, the word w = d p ^ 1 ~ k2 (bd) k ^ kl dc does the job 
when i 2 £ Qa \ {m — 2, m — 1} . 

If i 2 — m — 1, the word w has to be prefixed with an a to avoid (mvt4) or (mvt6)-movements. 
If z 2 = m — 2, we first read some d’s depending on the color of q n -\. If it is black or grey, we 
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read d p ~ k to send r*, on ro, coloring it in black. If it is green, we read dP~ k ' 2 to send ry on ro, 
coloring it in green. In both cases, we can now read an a from m — 2 being sure the states q n -\ 
and r 0 are not sensitive, and so avoid (mvt3)-movement. Then, we read a last a to reach the 
state 0 before reading w, where k[ and k' 2 are adjusted in consequence (they are respectively 
replaced by two new indices k” > k", where {k[, k' 2 } is sent to {k'{, k 2 } when reading the prefix 
above). 

— If R contains only one state then T contains at least two. We denote them q.y^ and qy 2 with 
n — 1 > j 2 > j\ or j\ — n — 1 (that is is nearest to q n -2 than qy when reading only a’s) and 
proceed similarly to the previous case (but not identically). We make a rotation to send on 
q n ~ 2 , then a grouping to have both green positions on q n ~2 and q n ~i and last, a contraction to 
decrease the green states number and solve the problem by induction hypothesis. So, in most 
of the cases, the word w = a n ~ 2 ~- 72 (ab) j2 ~ 3 ' babbabc does the job. The possible problems which 
can happen are: 

1. a (mvt5)-movement if we read the final c from state m — 2 ; 

2. (mvt3)-movements when reading an a from state m — 2 ; 

3. (mvt4)-movements when reading a b from state m — 1 ; 

4. a (mvt2)-movement when reading the final c. 

To solve the first problem, we ensure we are on position 0 onto the first automaton before 
reading the suffix babbabc. For this, we have to read sufficiently many ab factors in w. That is, 
if we are not on position 0 after reading the prefix a n ~ 2 ~ 32 (ab) 32 ~ 31 then we continue to read 
some ad s: any large enough (for example, larger than m) multiple of n — 1 is a good candidate. 
Notice that this solution may induce occurrences of problems (2) and (3), which are treated 
below. 

To solve the second problem, the solution consists, each time we have to read an a in the position 
m — 2, to read some d. s before the a. This allows to color r 0 in green or black accordingly to the 
current color of q n ~\ to ensure q n -\ and r 0 to be not sensitive. This is always possible since we 
have at least one black position (coming from ry.) and one green position in Qc as mentioned 
above (R is not empty). 

Remark that the reading of some d’s produces no (mvt6)-movement (because we are on state 
m — 2 in Qa )• And the adjonction of the cPs has no incidence regarding to the solution we have 
given for the first problem. 

Observe that the third problem has disappeared since if we are on state m — 1 when reading 
a b, then we just come from m — 2 by reading an a. As previously mentioned, we first ensured 
that q n ~i and ro were not sensitive. But this involve, after reading the a, that qo and ro are 
now not sensitive, hence avoiding a (mvt4)-movement. 

Last, to avoid the fourth problem, we proceed accordingly to the following (exhaustive) cases: 

• If ro and r p _i are contractable then read d before the final c. This action sends ro to r\ 
and r p _i to ro with no move on the other automata. This avoid a (mvt2)-movement while 
reading the final c. 

• If r 0 is uncontractable with r p _i, but is contractable with r p _2 then read bdb before the final 
c (the last b is necessary when n — 3 which implies q± = q n ~ 2 )- This action sends ro to r\ 
and r p _2 to ro with no move on the other automata. 

• If r 0 is uncontractable with r p _i and r p _ 2 (that is, the latter two are contractable) then read 
dd before the final c. 


Therefore, 

Theorem 5. sc(M • (N © P )) = (m — 1 )a n , P + oi' np 
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